Workspace setting and loading dataset

Elbow method to find optimal k (number of clusters)

this clustering is optimal when 4 clusters are used. We used k-means and k-mediods with k = 4 in following clustering experiments.

Apply k-means clustering with k=4

Calculate silhouette coefficient for above clustering

o.60 would be a fair value for silhouette coeffiencit. -1 is the worst and +1 is the optimal

k medoids

Install pyclustering on kernel

BIRCH